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ABSTRACT Microarrays cootaining 1046 human cDNAs 
of unknown sequence were printed on glass with high-speed 
robotics. Tliese l.O-cm^ DNA --chips" were used to quantita- 
lively monitor differenUal expression of the cognate human 
genes using a highly sensitive two-color hybridization assay. 
Array elements that displayed differential expression patterns 
under given experimental conditions were characterized by 
sequencing. The identification of known and novel heat shock 
and phorfool ester-regulated genes in human T cells demon- 
strates the sensitivity of the assay. Parallel gene analysis with 
microarrays provides a rapid and efTicient method for large- 
scale human gene discover^-. 

Biology has entered the genome era (IJ. Compleie genome 
sequences for all of the model oreanisms and human will 
probably be available by the year 2003 (2). Torrents of human 
expressed sequence tags (ESTs) provide a starting point for 
elucidarmg the funaion of tens of thousands of cognate genes 
(3). Genome analysis will provide insights into growth, devel- 
opmeni. differentiation, homeostasis, aging, and the onset of 
diseases ( 1-3). A detailed understanding of the human genome 
will require the impiemeniaiion of sophisticated methods for 
gene expression analysis and gene discovery. 

Recently, a microarray-based method for high-throuehpui 
monitoring of plant gene expression was described (4).' This 
'chip**-based approach involved using microarravs of cDNA 
clones as gene-specific hybridization targets to quantitatively 
measure expression of the corresponding plant genes (4. 5). A 
two-color fluorescence labeling and detenion scheme Vaciii- 
lated sensitive differential expression analysis of different 
plant tissues (4. 5). The efficiency of this approach for studies 
in higher plants suggested the use of thU method for human 
genome analysis (4-7). Here, we report the use of cDNA 
microarrays for human gene expression monitoring, bioloeical 
investigation, and gene discovery. 

iMATERlALS AND METHODS 

Human cDNA Clones. The cDNA librarv was made with 
mRNA from human peripheral blood Ivmphocvtes trans- 
formed with the Epsiein-Barr virus. Inscru >6do bp were 
cloned into the lambda vector AYES-R to generate lO'-lO* 
recombinants. Bacterial transformants were obtained by in- 
fecting £, coli strain JM107/AKC Colonies were piclced at 
random and propagated in a 96-well format, and minilvsate 
DNA was prepared by alkaline lysis using REAL preps 
(Oiagen. Chatswonh. CA). Inserts were amplified bv PGR in 
3 96- well format using primers (PAN132 5'-CCTC- 
TATACTTTAACGTCAAGG; and PAN133, S'-TTGTGTG- 
GAATTGTGAGQGG) complementary to the AYES 
polylinker and containing a six-carbon amino modification 
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(Glen Research. Sterling, VA) on the 5' end. PGR products 
were purified in a 96-welI format usine OlAquick columns 
(Oiagen). 

Microarray Preparation. Amino-modified PGR products 
were suspended at a concentration of 0,5 mg/ml in 3x 
standard saline citrate (SSG) and arraved from 96.well micro- 
liter plates onto siiylatcd microscope slides (GEL Associates. 
Houston) using high-speed robotics (4-7). A total of 1056 
cDNAs. representing 1046 human clones and 10 Arabidopsis 
controls, were arrayed in 1.0-cm- areas. Printed arravs were 
incubated for 4 hr in a humid chamber to allow rehydration of 
the array elements and rinsed, once in 0.2rr SDS'for 1 min 
twice in H:0 for 1 min, and once for 5 min in sodium 
borohydndc solution (1.0 g of NaBH4 dissolved in 300 ml of 
PBS and 100 ml of lOO^Tc eihanol). The arravs were submerged 
in H:0 for 2 min at 95"C transferred quicklv into 0.2^ SDS 
for 1 min. rinsed twice in H:0. air dried, and stored in the dark 
at 25'G. 

Fluorescent Probes. Tissue mRNAs were purchased 
(GLONTEGH). Jurkat mRNA was isolated as described by 
Schena & at. (4). Probes were made as described (4) with 
several modifications. The reverse transcriptase used here was 
Superscript II RNase H- (GIBGO). The Gy5-dGTP was 
purchased from Amersham. Each reverse transcription reac- 
tion contained 3.0 of total human mRNAS Ambidopsis 
control mRNAs were made by in vitro transcription of cloned 
HAT4. HAT22, and YesAt.23 cDNAs (4. 8. 9) usinc an RNA 
Transcription Kit (Siratagene). For quantitation, the mRNAs 
were doped into the reverse transcription reaction at ratios of 
1:100.000, 1:10,000. and 1 : 1000 (wt/wi) respectively. Following 
the reverse transcription step, samples were treated with 15 
of 1 M sodium hydroxide for 10 min at 3rG. then neutralized 
by adding 2.5 ^1 of 1 M Tris-HGI (pH 6.8) and 2.0 ^1 of 1 M 
HGI. Probe mixtures contained cDNA products derived from 
3 of total mRNA suspended in 5.0 of hybridization 
buffer (5x SSG plus Q.l'yc SDS). 

Hybridization and Scanning, Probes were hybridized to 
1.0-cm- microarrays under a 14 x 14 mm glass coverslip for 
6-12 hr at 60"C in a custom-built hybridization chamber (4-7). 
Arrays were washed for 5 min at room temperature (25X) in 
low stringency wash buffer (ix SSG/0.29r SDS), then for 10 
mm at room temperature in high stringency wash buffer (0,1 x 
SSG/0.29r SDS). Arrays were scanned in 0.1 x SSG using a 
fluorescence laser scanning device (4-7). fitted with a custom 
filter set (Chroma Technology. Brattleboro, VT). Accurate 
differential expression measurements (i.e.. final fluorescence 
ratios) were obtained by taking the average of the ratios of two 
independent hybridizations. 

Abbreviation: EST, expressed sequence tag. 

Data dcposiiion: The sequences reported in this paper have been 
deposited in the GcnBank data base (accession nos. U56654-US6660) 
TTo whom reprint requests should be addressed, e-mail: schcnafi' 
cmgm.sianford.edu. ^ 
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Cell Culture. Jurkai cells were erown in a tissue culture 
incubator OTC and 5^^ CO:) in RPMI medium suppiemenied 
with lOrc fetal bovine scrum. 100 of streptomycin per ml. 
and 500 units of penicillin per ml. Heat shock corresponded to 
a 4.hr incubation at AVQ. Phorbol ester treated cells were 
grown for 4 hr in the presence of 50 ng of phorbol 12-myristate 
13-acetatc (PMA> per ml. 
RNA Blotting. Dot blots were performed as described (4), 
DNA Sequencing. Sequences were obtained using the 
PAN132 and PAM33 primers and a 373A automated se- 
quencer, according to the instructions of the manufacturer 
(Applied Biosystems). 

Computer Graphics and Informatics. Pseudocolor represen- 
tations of fluorescent images were made with National Institutes 
of Health image software (version 1 J2). Software for differential 
expression representations was purchased from Imaging Re- 
search (St. Catherine s. ON. Canada). Sequence searches were 
made to the nonredundani nucleotide data base at the National 
Center for Biotechnology Information (NCBI) using Macintosh 
BLAST software. The EST data base was accessed via the World 
Wide Web (hitpi/wwu. ncbi.nlm.nih.gov/). 

RESULTS 

Gene Discovery and the Heat Shock Response. Microarrays 
were used to examine the heat shock response in cultured 
human T (Jurkai) cells. Control (3rC) and heat-treated 
(43'*C) cells were harvested and lysed. and total mRNA from 
the two cell samples was labeled by reverse transcriptase 
incorporation of fluorescein- and Cy5-dCTP, respeaivcly. In 
a second set of labeling reactions, the fluorescent groups were 
"swapped" such thai samples from control and heat-treated 
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samples were labeled with Cy5- and f luorescein-dCTP. respcc- 
nvely. Each pair of fluorescent probes was hvbridizcd to a 
1036-elcmeni microarray. The arrays were washed at hich 
sirmgenc>' and scanned with a confocal laser scanning device 
to detect emission of the two fluorescent croups. 

H>-bridi2ation signals were observed to >95'> of the human 
cDNA array elements, but not to am of the Ambidopsis 
negative controls (Fig. 1). Fluorescence intensities spanned 
more than three orders of maenitude for the 1(U6 arrav 
Clements surveyed (Fig. 1 ). Comparative expression analysis of 
heat shocked versus control cells in the two experiments 
revealed 17 array elements that displayed altered fluorescence 
ratios of ^:.0.fold (Figs. 1 and lAV Of the 17 putative 
differentially expressed genes. 11 were induced hv heat shock 
treatment and 6 displayed modest repression (Figs. I and lA ) 
To determine the identit>' of the heat-reculatcd ce"nes 
cDNAs corresponding to each of the 17 arrav elements were 
sequenced on the proximal and distal end. Data base searches 
revealed perfect matches for 14 of the 17 clones, and in each 
case proximal and distal cDNA sequences mapped to the same 
gene (Table 1). Of the 1046 human genes examined on the 
microarray. the five most highly induced in heat-treated celU 
were heat shock protein 90q (hsp90Q). dnaJ. hsp9O0. poKii- 
biquuin and i-complex poK-peptide-l (tcp-1) (Table 1), Three 
ot the 17 clones did not match any entry in the public data base 
though one of the clones (B7) exhibited sicnificam homology 
to an EST from Caenorhabditis elegans (Table 1). Each of the 
novel sequences (B7.B9) exhibited -2-fold induction (Table 1 ) 
and relatively low-level expression (Table 2). 

To confirm the microarray results. mRNA levels for each of 
the genes were measured by RNA blotting. Each of the genes 
that displayed heat shock induction, including the three novel 
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Table 1. Microarray elements corresponding to differentially expressed genes 



Clone 



Row 



Column 



Ratio 



Blast identity 



Bl 

b: 

B3 

B4 

By 

B6 

B7' 

B8 

B9 

BIO 

BlI 

Bi: 

Bl.-^ 
B14 
B15 
B16 
B17 
B18 
B19 
B20 

b:i 
b:: 
b:3 



24 

1 

32 
17 



14 

7 

12 
28 
14 
20 
30 
10 
13 
7 
21 
3 
1 



Accession no. 



21 
31 
8 

19 
8 

31 
4 

19 
5 
8 



7 
9 
12 



16 
19 
30 
26 
18 
30 
16 



0.5 
0.5 
0.5 
0.5 
0.5 
0.5 
2.0 
2.0 

2.4 

2.4 

2.5 

15 

2.6 

4.0 

5.8 

6.3 

2.0 

2.1 
^ 

2.6 

3,5 
19 



C>'C oxidase III 
0-Aciin 

CYC oxidase III 

CYC oxidase III 

CYC oxidase III 

3-Actin 

Novel' 

Nover 

Nover 

Polvubiquilm 

TCP-1 

Polyubiquiiin 

Polyuhiqumn 

HSP903 

DnaJ homoloc 

HSP90o 

HSP9l(o 

/J:-mtcroglobulin 
Nover 

3rmicroelobuhn 
PGK 
NF-kBI 
PAC-1 



J01415. J01415 
NR. X00351 * 
J01415, J01415 
J01415. J01415 
J01415. Jni415 
NR. \00351 
U 5 6653. U566.<14 
U56b55. U56656 
L'.'^6657. U56658 
XO4K03, XCWK03 
X52K82, ,\.^2RS2 
Ml 7507. M17.S97 
Xa4K03. XO4K03 
Ml 6660. Ml 6660 
D13.38«. D1.V^H8 
X07270. X07270 
M270:4. X15183 
S54761. M.'*0683 
L'56659. U56660 
S54761. M306S3 
Ml 1968. L00160 
Z47744, M55W3 
LI 1329. Ln.^29 



Clone narne. array position (Fig. 1). fluorescence raiio. sequence idcnt.iy. and accss.on number ol cDNAs thai manilcsicd 
a dillercmiaiexpre«ton pattern w.ih probes prepared from heat shock- {BM7) or phorbol csicr-ircaicd (818-23) Jurkai cells 

exc«?^ci",^d«. m?^ T »° 'd^"""* known sequences. All eenes arc nuclear 

except CYC oxidase III (mitochondrial). Accession numbers reflect the highest score for proximal and distal sequence traces 
respectively, CYC cytochrome c TCP-l. T-complex poKT>ept,dc: HSP. hea, shock protein: PGK. phosphoelvcera.e k na" 
Sa; ""tra"' factor-kappaB: PACK phosphaia5e of acmated cells; and NR. trace no, readable due to the presence o^ 

*B7 is 67rc identical to an EST from C. W<yoiij (D76026). 
*No match in the public data bases. 
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Table Human gene cxpreg.on mon.tored by microarray and RN'A blot analv^ 

Expression icvel per 1(K mRNAs 



Clone 


Blast identity 


Bl 
B2 


V I \v uxioasc III 
Act in 


B3 


CYC nviHac^ IIT 


B4 


^TL oxidase III 


B5 


uTv, oxidase III 


B6 


^ Act in 


B7 


Novel (weakly to D76026) 


B8 




B9 




DIU 


Polyubiquitin 


Dli 




B12 


Polyubiquitin 


B13 


Polyubiquitin 


B14 


HSP903 


B15 


DnaJ homolog 


B16 


HSP90o 


B17 


HSP90a 


B18 


^^microglobulin 


B19 


Novel 


B20 


^rmic^oglobulin 


B21 


Phosphoglycerate kinase 


b:: 


NF-KBl 


B23 


PAC-l 





Ratio 


RNA blot 


92/46 


OS 


100/80 


240/120 


OS 


270/280 


36/18 


OS 


ND 


76/38 


OS 


ND 


62/31 


OS 


ND 


180/89 


OS 


ND 


U/2.6 


2.0 


0 77/1 R 


2.0/4.0 


2.0 


IS/XA 


0.8/1.8 


2.2 


1.2/1.8 


0.8/1.9 


2.4 


1^/89 


2.3/5.5 


2.4 


7.1/27 


0.8/2.0 


2-5 


ND 


1.7/4,3 


2.5 


ND 


75/200 


2.6 


30/120 


1.0/4.0 


4.0 


1.6/13 


0.6/3-5 


5.8 


3.2/29 


0.8/5.0 


6.3 


8.6/62 


1.0/2.0 


2.0 


5.4/15 


U/2.5 


2.1 


4-5/9.5 


2.7/5.9 


2.2 


ND 


2.4/6.2 


2.6 


4.7/9.2 


1.7/6.0 


3„S 


0,65/4.7 


0.5/9.5 


19 


0.21/15 



Ratio 

as 

1.0 

ND 

ND 

ND 

ND 

2.3 

2.3 

IS 

3.6 

3.8 

ND 

ND 

4.0 

8.1 

9.1 

7.2 

2.8 

2.5 

ND 

2.0 

7.2 

71 



/t'' M ^'^^"^^^^ '"^^^ by dot blot analysis 

UaDle 2). In all cases, expression ratios as determined bv the 
two procedures differed by <2-fold for the genes identified in 
the heat shock experiments (Table 2). The two assavs differed 
more widely in terms of assessing absolute expression levels- 
nonetheless, absolute expression as monitored on a microarrav 
typically correlated with RNA blois to within a factor of five 
(Table 2). 

Phorbol Ester SisaaliniE. To explore a signaling pathway 
distinct from the heat shock response, microarravs were used 
to examine the cellular effects of phorbol ester treatment 
Jurkat cells were treated with phorbol ester, harvested Ivsed 
and used as a source of mRNA. Samples of mRNA from' 
untreated or phorbol esier-stimulated cells were labeled with 
reverse transcriptase. The probes were mixed, hybridized to 
microarravs. and scanned for fluorescence emission of the two 

^^n',";!"' ^ °^ elements displayed 

a2.0-fold elevated signals with probes from phorbol ester- 
treated cells relative to control samples (Fig. 2B) 

To determine the identity of the phorbol ester-induced 
genes, clones corresponding to the six array elements were 
sequenced. Data base searches revealed perfect matches for 
five of the su sequences fTable 1». The two most highly 
induced genes were the PAC-I tyrosine phosphatase and 
nuclear factor-kappa Bl (SF-kBI): modest activation was 
?T m"', . X' P'^'sphoglycerate kinase and ^.-microglobulin 
(Table I). One remaining clone (B19) did not match any entry 
m the public data base (Table 1). B19 displayed a il-fold 
induction and. similar to the novel heat shock 'genes, a rela- 
ively low absolute expression level (Tables 1 and 2). All sU of 
the phorbol Mter-inducible gsnes displayed increased steady- 
state mRNA levels Ijy RNA blotting (Table 2) PAC-l expres- 
sion (Fig. 1: Table 2) defined a detection limit of -1:500.000 
for the assay. 

u,hl^"*""-'" """"" ■^'»"«- To determine 

whether microarravs could be used to monitor expression in 
human tissues, probes were prepared from human bone mar- 



wirh rvTA?I?D • ^ each mRNA sample 

with Cy5-dCTP. In a separate reaction, a control probe was 
prepared by labeling Jurkat mRNA with fluorescein-dCTP 
The four CyS-labeled probes were each mixed with an aliquot 
of the f luoresccin-labeled control sample, and the four mix- 
tures were hybridized to separate microarravs. The arrays were 
washed and scanned for fluorescence emission, and hybrid- 
aation signals for each of the tissues samples weft normalized 
^, h ,n f°""°' '° generate an expression profile for each 
ot the 1046 clones present on the array. 

Detectable expression was observed for all 15 of the heat 
shock and phorbol ester-regulated genes in the four tissue 
types examined (Fig. 3). In general, the expression level of each 
gene m Jurkat cells correlated rather closely with expression in 
the four tissues (Table 2; Fig. 3). Genes encoding /3-actin and 
cytochrome c oxidase, the two most hichlv expressed of the 15 
genes in Jurkat cells (Table 2). were highly expressed in bone 
marrow, brain, prostate, and heart (Fig. 14). Expression of 
cytochrome c oxidase. hsp90a. and the novel B7 sequence was 
significantly greater in heart than in the other tissues (Fig. 3). 

DISCUSSION 
Many of the heat shock genes identified in this study encode 
ur^'.^'*'^' function either as molecular • chaperones" 
(HSP90a. HSP90fl. DnaJ. TCP-l) or as mediators of pToteln 
degradation (polyubiquitin). The identification of these se- 
quences is consistent with the biochemical basis of heat shock 
induction (10-15). Proteins undergo denaturation at elevated 
temperatures, and those that fail to maintain proper confor- 
mation must be selectively degraded (10-15). It will be inter- 
esting to determine whether the three novel heat shock- 
inducible sequences (B7-B9) mediate protein folding and 
luniover or possess some other biochemical activity. Complete 
nucleotide sequence determination, conceptual translation 
expression monitoring, and biochemical analysis should pro-' 
yide a detailed functional understanding of these genes 
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Phorbolcsier.apoieniartivaiorofproicmkinastfC(16. 17). 
induced a sei of genes distinct from those involved in the heat 
shock pathway. The most highly induced gene identified in this 
study. PAC'L encodes a nuclear tyrosine kinase that may plav 
a role in regulating transcription and cell c>cle progression 
(18). NF-icBl. a second phorbol cster-inducibic gene, is an 
intensively studied member of the Rcl transcription factor 
family (19-21). The Rel proteins arc activated by a large 
number of stimuli, including phorbol esters, cytokines, bacte- 
rial and viral pathogens, and ultraviolet light (19-21). Modest 
activation was observed for three sequences not known to be 
inducible by phorbol esters, including phosphoglyceraie ki- 
nase. 3:-microglobuIin. and a novel human gene (B19). Ex- 
tensive expression monitoring with microarrays should assist in 
understanding how each of these genes integrate into the 
highly complex phorbol ester signaling pathway. 

It is striking that four novel human" genes were discovered 
with an array of 1000 randomly chosen clones, particularly 
because the heat shock and phorbol ester signaling pathwavs 
have been so intensively studied (10-21). The facile discovery 
of these sequences underscores the fact thai microarrays can 
be used for gene discovery in the absence of any sequence 
information. By this approach, clones are chosen at random 
from any library of interest and only those clones that display 
interesting expression pauems are sequenced and character- 
ized. This parallel assay, coupled wiih a modest DNA sequenc- 
ing facility, allows high-throughput human genome expression 
analysis and gene discover)'. 

Genes that are activated or repressed by a given stimulus 
provide functional clues to the cellular pathway involved 
(22-24). Detailed examination of these gene expression '•sig- 
natures" can provide a dynamic view of the mode of aaion of 
a given signaling substance (22-24). Microarrays may thus 
allow rapid mechanistic examination of hormones, drugs, 
elicitors. and other small molecules; moreover, functional 
analysis of transcription factors, kinases, growth factors, cyto- 
kines, receptors, and other gene products should be possible. 
Effons are underway to develop mRNA amplification strate- 
gies to enable probe preparation from minute tissue samples. 
This capability might allow for high-throughput patient screen- 
ing in a clinical setting. 

The current detection limit of the assav allows monitoring of 
transcripts that represent *1:500.000 (wt/wi) of the total 
mRNA- This 10-fold increase in sensitivity compared with the 
original repon (4) was achieved largely by modifying the 
coupling chemistry, which reduced background fluorescence. 
The significance of this improvement is considerable in that 
approximately half the human genes identified in this study, 
including all four novel sequences, exhibited expression levels 
below the original detection limit of 1:50.000 (4). 

The ability to detect 2-fold changes in expression was 
achieved by the use of two-color fluorescence in the labeling 
and detection schemes, digitized data collection, and custom 
software. The importance of this capability is underscored by 
the fact that nearly alt of the genes examined here exhibited 
<6-fold changes in expression. The four novel genes, which 
showed £2.2-fold activation, were probably overlooked in 
previous screens that used conventional differential expression 
techniques. It may be possible to further improve the precision 
of the microarray assay by the use of closely related fluorescent 
analogs, such as Cy3 and Cy5. in the labeling and hybridization 
reactions. 

Microarrays offer a number of advantages over other po- 
tential high-capacity approaches to expression analysis. The 
chip-based approach enables small hybridization volumes, high 



Fig. 3. Transcripi profiles of heat shock and phorbol esicr- ^J^^ densities, and the use of fluorescence labeling and 

regulated genes. Gene expression levels per 100,000 mRNAs U-axcs) detection schemes. These features provide a set of pcrfor- 

arc shown for 1.1 genes (Table 1) in human bone marro* (red), brain mance specifications that are unattainable with filter-based 

(green), prosiaie (blue), and heart (yellow). Genes are grouped approaches (25. 26). The use of cDNA clones provides hy- 

according to expression levels {A-Q. bridizaiion specificity that is not readily attained with oligo- 
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nucieoiide arrays (27-30). The parallel formal of the assav 
^'Tr^ ^ simultaneous differential expression readout for 
>1000 genes. This contrasts with sequencine-based methods 
which require serial data collection for expression analvsis (31* 
3.). A commercial source of cDxNA microarravs would great N 
speed the use of a chip-based approach to expression analysis' 
The availability of large numbers of ESTs (3) provides a 'rich 
resource of human cDNA clones for microarravine The 
>400.000 ESTs in the public data bases represent a ii^^ifi«m 
subset of all human genes (3. 33). Microarravs of ihoiLnds of 
ESTs will provide a powerful analytical tool for future human 
gene cxpre^ion studies. The -100.000 genes in the human 
genome (2. 33) emphasize the need for microarravs of greater 
density. Attempts lo improve microdeposition techniques are 
underway and should allow construction of arravs comainine 
a complete set of human gene targets (http://ci;igm.stanforcl 
edu/--schcna/). Microarravs of -100.000 cDNA elements 
would allow expression monitoring of the entire human ec- 
nome in a smgle hybridization. Tliis capacity, coupled with 
detailed biochemical analysis of the individual gene products 
would greatly speed the functional analvsis oi the humTn 
genome. 
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